Time-Frequency Energy Features for Articulator Position Inference on Stop Consonants
نویسندگان
چکیده
Acoustic-to-Articulatory inversion offers new perspectives and interesting applications in the speech processing field; however, it remains an open issue. This paper presents a method to estimate the distribution of the articulatory information contained in the stop consonants’ acoustics, whose parametrization is achieved by using the wavelet packet transform. The main focus is on measuring the relevant acoustic information, in terms of statistical association, for the inference of the position of critical articulators involved in stop consonants production. The rank correlation Kendall coefficient is used as the relevance measure. The maps of relevant time–frequency features are calculated for the MOCHA–TIMIT database; from which, stop consonants are extracted and analysed. The proposed method obtains a set of time–frequency components closely related to articulatory phenemenon, which offers a deeper understanding into the relationship between the articulatory and acoustical phenomena. The relevant maps are tested into an acoustic–to–articulatory mapping system based on Gaussian mixture models, where it is shown they are suitable for improving the performance of such a systems over stop consonants. The method 1 Magíster en Automatización, PhD(c) en Ingeniería-Automática, Universidad Nacional, Manizales, Colombia, [email protected]. 2 PhD en Telecomunicaciones, [email protected], profesor, Universidad Nacional, Manizales, Colombia Universidad EAFIT 37| Time-Frequency Energy Features for Articulator Position Inference on Stop Consonants could be extended to other manner of articulation categories, e.g. fricatives, in order to adapt present method to acoustic-to-articulatory mapping systems over whole speech.
منابع مشابه
Inference of Critical Articulator Position for Fricative Consonants
Inversion aims to estimate the articulatory movements which support an acoustic speech signal. Within the acoustic–to–articulatory mapping framework, time frequency atoms had been also employed. The main focus of present work is estimating the relevant acoustic information, in terms of statistical association, for the inference of critical articulators position; in particular, those involved on...
متن کاملDetermining articulator configuration in voiced stop consonants by matching time-domain patterns in pitch periods
متن کامل
Analysis of Stop Consonants in Indian Languages Using Excitation Source Information in Speech Signal
In this paper we propose excitation-based features for extracting information about the manner of articulation for stop consonants. The excitation-based features are derived from very low frequency information in the signal and also from the normalized error computed from the linear prediction residual. The proposed zero-frequency filtered signal brings out the region of glottal activity during...
متن کاملStop Consonant Classification Using Wavelet Packet Transforms and a Neural Network
A wavelet packet transform is described to compute N spectral/temporal features for the 6 English stop consonants /b,p,d,t,g,k/. These features were used by a Binary Pair Partitioned neural network for speaker-independent classification of the stop consonants. The wavelet packet transform is generated by a pair of quadratic mirror filters which decompose the signal into a series of subbands ("f...
متن کاملIn search of non-uniqueness in the acoustic-to-articulatory mapping
This paper explores the possibility and extent of non-uniqueness in the acoustic-to-articulatory inversion of speech, from a statistical point of view. It proposes a technique to estimate the non-uniqueness, based on finding peaks in the conditional probability function of the articulatory space. The paper corroborates the existence of non-uniqueness in a statistical sense, especially in stop c...
متن کامل